A comparative study on rough set based class imbalance learning

نویسندگان

  • Jinfu Liu
  • Qinghua Hu
  • Daren Yu
چکیده

This paper performs systematic comparative studies on rough set based class imbalance learning. We compare the strategies of weighting, re-sampling and filtering used in the rough set based methods for class imbalance learning. Weighting is better than re-sampling, and re-sampling is better than filtering. The weighted rough set based method achieves the best performance in class imbalance learning. Furthermore, we compare various configurations of the weighted rough set based method. The weighted rule extraction and weighted decision have greater influence on the performance of the weighted rough set based method than the weighted attribute reduction. The weighted attribute reduction based on the weighted degree of dependency, the rule extraction for the exhaustive set of rules and the weighted decision based on the majority voting of the factor of weighted strength are the optimal configurations for class imbalance learning. Finally, we compare the weighted rough set based method with the decision tree and SVM based methods. The experimental results show that the weighted rough set based method outperforms the decision tree and SVM based methods. It can be concluded from the comparisons that the weighted rough set based method is effective for class imbalance learning. 2008 Elsevier B.V. All rights reserved.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A weighted rough set based method developed for class imbalance learning

In this paper, we introduce weights into Pawlak rough set model to balance the class distribution of a data set and develop a weighted rough set based method to deal with the class imbalance problem. In order to develop the weighted rough set based method, we design first a weighted attribute reduction algorithm by introducing and extending Guiasu weighted entropy to measure the significance of...

متن کامل

Weighted Rough Set Learning: Towards a Subjective Approach

Classical rough set theory has shown powerful capability in attribute dependence analysis, knowledge reduction and decision rule extraction. However, in some applications where the subjective and apriori knowledge must be considered, such as cost-sensitive learning and class imbalance learning, classical rough set can not obtain the satisfying results due to the absence of a mechanism of consid...

متن کامل

Fuzzy rough classifiers for class imbalanced multi-instance data

In multi-instance learning, each learning object consists of many descriptive instances. In the corresponding classification problems, each training object is labeled, but its constituent instances are not. The classification objective is to predict the class label of unseen objects. As in traditional single-instance classification, when the class sizes of multi-instance data are imbalanced, cl...

متن کامل

T-Rough Sets Based on the Lattices

The aim of this paper is to introduce and study set- valued homomorphism on lattices and T-rough lattice with respect to a sublattice. This paper deals with T-rough set approach on the lattice theory. The result of this study contributes to, T-rough fuzzy set and approximation theory and proved in several papers. Keywords: approximation space; lattice; prime ideal; rough ideal; T-rough set; set...

متن کامل

The Comparative Effect of Using Idioms in Conversation and Paragraph Writing on EFL Learners’ Idiom Learning

This study investigated the comparative effect of teaching idiomatic expressions through practicing them in conversation and paragraph writing on intermediate EFL learners’ idiom learning. The participants were sorted out of a population of 134 intermediate students in Zabansara Language School in Khorramabad based on their scores on a Preliminary English Test (PET) and an idiom test piloted in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Knowl.-Based Syst.

دوره 21  شماره 

صفحات  -

تاریخ انتشار 2008